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BACKGROUND OF THE INVENTION 

The invention disclosed and claimed herein generally pertains to a method for 
compression of video signal data. More particularly, the invention pertains to a method of the 
above type wherein the video data is collectively represented by arrays of coded macroblocks 
and skipped (or non-coded) macroblocks. Even more particularly, the invention pertains to a 
method of the above type wherein macroblocks are grouped into slices, and run-length coding 
is used to indicate the number of skipped macroblocks therein. 

Digital video signals, in non-compressed form, typically contain large amounts of 
data. However, the actual necessary information content is considerably smaller due to high 
temporal and spatial correlations. Accordingly, video compression, or coding, is used to 
reduce the amount of data which is actually required for certain tasks, such as storage of the 
video signals or for transmitting them from one location to another. In the coding process 
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temporal redundancy can be used by making so-called motion-compensated predictions, 
where regions of a video frame are predicted from similar regions of a previous frame. That 
is, there may be parts of a frame that do not contain any, or hardly any, change at all from 
corresponding parts of the previous frame. Such regions can thus be skipped or non-coded, 
in order to maximize compression efficiency. On the other hand, if a good match with a 
previous frame cannot be found, predictions within a frame can be used to reduce spatial 
redundancy. With a successful prediction scheme, the prediction error will be small and the 
amount of information that has to be coded greatly reduced. Moreover, by transforming 
pixels to a frequency domain, e.g., by using the discrete cosine transform, spatial correlations 
provide further gains in efficiency. Prior art techniques of video compression are described, 
for example, in U.S. Patent No. 5,903,313, issued May 11, 1999. 

Notwithstanding the benefits of video data compression, the coded bitstream, i.e., 
compressed data which is transmitted from one location to another, may become corrupted 
due to error-prone transmission channels. If this happens, relying too much on prediction 
may cause large potential damage such as propagating errors and severely decreasing video 
quality. In the past, common techniques of reducing such damage have included preventing 
temporal predictions at certain times, and constraining spatial predictions within bounded 
segments of a video frame, or picture. The overall quality of the decoded video signal 
(following compression and transmission) is thus a trade-off between high compression on 
the one hand, and error resilience on the other. However, there are potential conflicts 
between using flexible and error robust segmentation, and an effective signaling of skipped or 
non-coded regions of a video data frame. Prior art methods handle these conflicts by either 
restricting segmentation to rigid areas, with a loss of flexibility and efficiency, or by 
enforcing coding of regions within a segment that in principle do not need to be coded. 
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Herein, the terms "picture" and "frame" are used interchangeably to refer to a frame 
of image data in a video sequence. 

In order to limit spatial error propagation, different methods or techniques of 
segmenting a picture are available. Two standard methods are provided by ITU-T 
Recommendations H.261 and H.262, respectively. In Recommendation H.261, each frame of 
a video sequence is divided into a number of segments called Groups of Blocks (GOBs). 
Each GOB contains 33 macroblocks arranged in 3 rows by 1 1 columns, and each macroblock 
covers an area of 16 x 16 pixels. Accordingly, the two picture formats supported by H.261, 
the Common Intermediate Format (CIF) containing 352 x 288 pixels, and the Quarter-CIF 
(QCIF) containing 176 x 144 pixels, are formed by combining 12 and 3 non-overlapping 
GOBs, respectively, in order to cover the entire picture. 

Within a GOB macroblocks are positioned row by row and left to right, with the first 
block at the top left corner and the 33 rd at the bottom right corner. However, only those 
macroblocks that differ from the previously coded picture (if there was a previously coded 
picture) need to be coded. Thus, a macroblock address is transmitted before each coded 
macroblock to signal its position within the GOB. For the first coded macroblock in a GOB, 
the address is the macroblock position in the GOB. For subsequent macroblocks, the address 
is the increment of position from the previously coded macroblock. The macroblock 
addresses are given by the variable-length code words shown in Figure 1 . Macroblocks that 
are not addressed should be skipped and copied from the previous picture. That is, the video 
data for a skipped macroblock is the data from the corresponding macroblock of the previous 
picture. It is to be noted that although there is no restriction on which or how many 
macroblocks need to be coded, a GOB always refers to 33 macroblocks, whether coded or 
not. The positions of the beginning and end of each GOB are defined by the H.261 standard 
and are signaled in the bitstream by a GOB number in each GOB header. 
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A more flexible picture segmentation technique can be found in the MPEG-1 and 
MPEG-2 video standards, ISO/IEC 1 1 172-2 and ISO/IEC 13818-2, respectively. The latter is 
also jointly published by ITU-T as Recommendation H.262. In accordance therewith, a 
picture is segmented into slices which differ from the above GOBs used in Recommendation 

5 H.261 . Each slice comprises an arbitrary number of macroblocks, arranged consecutively in 
a sequence or series, wherein the ordering of macroblocks is referenced to an entire picture 
starting at the top left corner. Each slice must be at least one macroblock long and may not 
overlap other slices. MPEG-1 allows slices of arbitrary length, but does not allow gaps 

pt between slices. MPEG-2, on the other hand, constrains slices to fit within one row of 

Wsl-" 

1§| macroblocks but allows gaps in its general slice structure. Here the occurrence of a gap 
Pi between slices informs the decoder implicitly that the corresponding macroblocks are 

u 

* skipped, and that the data for these skipped macroblocks should be copied from the pixel 

f^. - values of the corresponding macroblocks of the previous picture. 

BJ 

Sj ■ : One important restriction on both MPEG-1 and MPEG-2 slices is that the first and the 

Hi 

15 last macroblocks included in a slice must be coded, i.e., a slice may not start or end with a 
skipped macroblock. Because of this requirement it is always clear where the previous slice 
ended when a new slice starts, that is, the previous slice ended right after the last coded 
macroblock was transmitted. Except for the first and last macroblocks, a slice may contain 
skipped macroblocks that do not have to be transmitted. Accordingly, the position of each 

20 coded macroblock within a slice is signaled in the bitstream. This is done in the same manner 
as for the GOB arrangement of Recommendation H.261. The relative position of each 
macroblock is referred to as the macroblock address increment, and is given by the 
codewords of Figure L In order to allow more than 32 consecutive skipped macroblocks in a 
slice, a macroblock escape codeword is included as well. One ore more escape codewords 
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can be inserted before the actual macroblock address increment, each indicating an additional 
increment of 33 macroblocks. 

The freedom to use a variable number of macroblocks in a segment makes it possible 
to keep the segment within a certain bit size. This is very useful when segments are 
packetized for transmission in an IP (Internet Protocol) network, and particularly important 
for wireless applications where packet sizes are small. Furthermore, by adjusting the size of 
segments it is possible to adjust for various transmission environments. In a lossy channel, 
for instance, compression efficiency can be traded for better video quality using smaller 
segments. The bit-error rate can thus be used to find an optimal segment size. As for 
Recommendation H.261 described above, segment boundaries for the GOBs are pre-defined. 
Hence, the size of a GOB is difficult to control, making these segments very inflexible. To 
this end slices in MPEG-1 and MPEG-2 are more versatile. However, since slice boundaries 
are signaled by ensuring that the last macroblock of a slice is always coded, an efficient 
compression and optimal slice size can be difficult to achieve within these standards. The 
reason is that one may be forced to code a macroblock, that otherwise could have been 
skipped, or alternatively deviate from the desired slice size. These disadvantages are further 
emphasized with reference to Figure 2. 

Figure 2 shows a row 10 of macroblocks in a QCIF picture, which is otherwise not 
shown. The macroblocks may be encoded at a first location, transmitted to a decoder at a 
second location, and decoded thereby to recover the original video data. The original data is 
typically recovered with a quality degradation depending on how hard the compression is 
done. Figure 2 further shows the end portion of a slice 12, including at least coded 
macroblocks 12a and 12b, the beginning portion of a slice 14, including at least a coded 
macroblock 14b, and skipped macroblocks 16a-f positioned between the coded macroblocks 
12a-c and 14a-b. The skipped macroblocks between coded macroblocks 12c and 14a may 
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create problems in segmenting the macroblocks shown in Figure 2, if segmentation must be 
done in accordance with the MPEG-2 standard and if it is desired to place a slice boundary 
between B and C, in order to form slices 12 and 14 so that each slice could be placed into a 
separate IP packet of specified maximum size. The location of the slice boundary is even 
more important if slices 12 and 14 are to precisely fill their IP packets to maximum capacity. 
If the MPEG-2 standard is applied to the prior art arrangement shown in Figure 2, three 
alternatives are possible, each having attendant disadvantages. 

In the first alternative, the slice 12 would end at B in Figure 2, and slice 14 would 
start at C. The skipped macroblocks 16a-f between B and C would be implicitly considered 
as skipped by the decoder, since no data would be transmitted for the skipped blocks. 
However, if the compressed data was transmitted through a lossy channel, such as a wireless 
communication channel, the IP packet starting at C and thus containing slice 14 could be lost 
and thus not received at the decoder. In this event, all of the information regarding the 
skipped macroblocks 16a-f would also be lost. In particular, the decoder would not be able to 
distinguish skipped macroblocks from lost macroblocks. 

In the second alternative, the slice boundary would be located at either B or C, 
without a gap between slices 12 and 14. Accordingly, if the boundary was placed at B in 
Figure 2, the last macroblock of slice 12 would be coded macroblock 12c but the first 
macroblock of slice 14 would be skipped macroblock 16a. On the other hand, if the 
boundary was placed at C, the first macroblock of slice 14 would be coded macroblock 14a 
but the last macroblock of slice 12 would be skipped macroblock 16f. As stated above, 
neither arrangement would be allowed, since the MPEG-2 standard does not allow skipped 
macroblocks at the beginning or end of a slice. Therefore, it would be necessary to 
additionally code one of the skipped macroblocks, either skipped macroblock 16a or 16f. 
Moreover, the skipped macroblock would have to be coded with zero (absolute) motion 
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vector and no coefficients. In this manner, the skipped macroblock information between B 
and C could be sent. However, the number of additional bits which would have to be 
transmitted as a result of coding the skipped macroblock could be quite significant, 
particularly if the predicted motion vector is large. 

In the third alternative, the slice boundary would be placed at either position A or D in 
Figure 2. This avoids the respective drawbacks discussed above in connection with the first 
and second alternatives. However, as stated above, it is desired to send the data of respective 

|| slices in IP packets, with one slice per packet and respective packets having a fixed maximum 

•N - 

p bit size. Clearly, every packet should be filled with as many bits as possible, in order to fully 

m 

Wo utilize the network capacity. Accordingly, placing the boundary at A could be sub-optimal, if 

slice 12 thereby became too small to use all the bit capacity of a packet. Placing the 
l| boundary at D would not be possible, since slice 12 would become so large that it would 

f% exceed the maximum packet size. 

fe- 
ll-: 

SUMMARY OF THE INVENTION 

15 The invention pertains to a method of video compression wherein a frame of video 

data is segmented into slices. The slices are allowed to contain an arbitrary number of 
consecutive macroblocks, starting and ending at arbitrary positions within a picture and 
thereby giving full freedom to start and end slices at any position. Since slices can contain 
variable numbers of macroblocks, embodiments of the invention can be readily adapted to 

20 provide a coding standard for different coding environments. 

One embodiment of the invention, directed to a method for compressing a frame of 
video image data, includes the steps of dividing the frame into a number of blocks comprising 
coded blocks and skipped blocks, and consecutively grouping the blocks into a sequence of 
slices. The method further includes the step of inserting a run-length codeword after the final 
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coded block in a slice to indicate the number of skipped blocks which are positioned between 
the final coded block and the end of the slice. Thus, in one aspect of the invention, non- 
coded regions of a frame of video data are effectively signaled using run-length coding. 
Consecutive skipped blocks are grouped together and encoded as a single symbol, referred to 
herein as a run-length codeword to indicate the number of skipped blocks. By signaling all 
non-coded regions within a segment of a frame, it is possible to efficiently take advantage of 
a flexible picture segmentation, and also to effectively conceal damaged picture areas. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a table showing codewords for indicating the address of a macroblock 
relative to a previous macroblock. 

Figure 2 is a schematic diagram showing a row of macroblocks in a QCIF image for 
illustrating disadvantages of the prior art. 

Figure 3 is a schematic diagram showing a row of macroblocks to be compressed or 
encoded in accordance with an embodiment of the invention. 

Figure 4 is a schematic diagram showing the macroblock row of Figure 3 after 
compression. 

Figure 5 is a table showing run-length codewords for use in an embodiment of the 
invention, and more particularly shows the first ten variable codewords of an infinite table. 

Figure 6 is a schematic diagram showing a row of macroblocks encoded in 
accordance with a modification of the invention. 

Figure 7 is a schematic diagram showing a simplified system for compressing, 
transmitting, and decoding video data in accordance with an embodiment of the invention. 

DETAILED DESCRIPTION OF EXEMPLARY EMBODIMENT 
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Referring to Figure 3, there is shown a row of macroblocks 18 to be compressed in 
accordance with an embodiment of the invention, which eliminates or substantially reduces 
problems of the type described above in connection with the prior art. Row 18, comprising 
macroblocks 18a- 18k, is similar to row 10 of Figure 2 and is associated with motion- 
compensated digital video compression, whereby respective frames are divided into 
macroblocks, each macroblock having a luma resolution of 16 x 16 pixels. Macroblocks 18a- 
18c and 18j-18k are coded macroblocks encoded in accordance with conventional practice so 
that motion compensation information is associated therewith, and macroblocks 18d-18i are 
skipped macroblocks as described above. 

In accordance with the invention, the blocks 18a-18k are to be segmented into slices, 
wherein slices are allowed to contain an arbitrary number of consecutive macroblocks and 
can start and end at arbitrary positions within the associated frame. Accordingly, full 
freedom is given to start and end slices at any position . In view of this freedom, the 
macroblocks shown in Figure 3 are segmented into a first slice 20 and a second slice 22 by 
locating a slice boundary at C. It is assumed that this will provide slices 20 and 22 which are 
both sized to fill corresponding IP packets on an optimal basis, that is, each packet will be 
filled to maximum capacity. Thus, slice 20 ends with macroblocks 18a-18i and slice 22 
begins with macroblocks 18j and 18k. More specifically, slice 20 ends with six skipped 
macroblocks 18d-18i. 

Referring to Figure 4, there is shown slices 20 and 22 and their respective 
macroblocks after compression in accordance with the invention. In particular, Figure 4 
shows a run-length codeword 24 included in slice 20, immediately after the last coded 
macroblock thereof, i.e. after macroblock 18c. Codeword 24 defines the end of slice 20, and 
has an index indicating the number of non-coded or skipped macroblocks which follow the 
last coded macroblock in slice 20, prior to the next slice. Codeword 24, which need be no 
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more than a few bits, thus indicates to the decoder that the six skipped macroblocks 18d-18i 
follow coded macroblock 18c within slice 20. Accordingly information regarding the 
skipped macroblocks will reach the decoder even if the next slice, slice 22, is lost in 
transmission. This robustness against error is provided at low overhead cost, since codeword 
24 need not be more than a few bits in length. Thus, it is seen that an embodiment of the 
invention enables the sizes of respective slices to be readily adapted to match the sizes of IP 
packets. This is especially useful in wireless applications where the packets are relatively 
small. At the same time, the compression of skipped macroblocks at the end of a slice is very 
error resilient. 

While not shown, each compressed slice or group includes a header or a start code. 
In one embodiment of the invention, the run-length codeword 24 could comprise a 



ll codeword associated with one of the macroblock addresses shown in Figure 1 . In this 

y 

C5 embodiment coded macroblocks are addressed using a run-length coding of preceding 

O 

W skipped macroblocks, similar to the addressing of GOBs in H.261, as described above. 
1 5 However, the codeword 24, instead of addressing a coded macroblock, would signal the 

number of skipped macroblocks between the last coded macroblock and the end of the slice. 

In order to account for the occurrence of zero skipped blocks, the number selected from 

Figure 1 would be equal to the number of skipped blocks plus one. Thus, to indicate the six 

skipped blocks 18d-18i, the number 7 would be selected from Figure 1, and the codeword 24 
20 therefore would be 00010. Since Figure 1 only discloses 33 addresses, an escape codeword 

could be used to indicate an additional increment of 33 skipped macroblocks, in analogy with 

the MPEG-1 and MPEG-2 standards described above. 

In a second embodiment of the invention, run-length codeword 24 may be generated 

from an infinite variable-length code. A code of this type would eliminate need for the 
25 escape codeword described above. One very useful infinite variable-length code is illustrated 

DALLAS2 826086v6 34645-00532USPT \ Q 



PATENT APPLICATION 
Atty. Docket: 34645-532USPT 

P14487 

in Figure 5. In a shorthand notation, codewords of the infinite code are generated according 
to the following pattern: 1; 0 x 1; 0 x 0 x 1; 0 x 0 x Ox 1; 0 x 0 x 0 x 0 x 1..., where eachx 
can be either 0 or 1 . 

Figure 5 further shows a symbol, referred to as SKIPRUN, which is a single symbol 
representing the run-length encoding of consecutive skipped blocks. Thus, for the six skipped 
macroblocks 18d-18i shown in Figure 3, SKIPRUN would be 6 and run-length codeword 24, 
from Figure 5, would be 0101 1. In this embodiment SKIPRUN is always transmitted at the 
beginning and at the end of a slice group, and is also inserted between any two adjacent 
coded macroblocks, to indicate the number of skipped macroblocks therebetween. 

In a modification of the invention, the run-length codeword 24 is not used to signal 
zero skipped macroblocks, in the event there are no skipped macroblocks following the last 
coded macroblock, before the end of the slice. In this modification, the decoder would 
simply recognize that the absence of the run-length codeword between slices indicated zero 
skipped macroblocks therebetween. Accordingly, the coding arrangements shown in Figures 
1 and 5 could be shifted. Thus, the six skipped macroblocks 18d-18i, in the arrangement of 
Figure 1 , would be indicated by the codeword 0001 1 . In the arrangement of Figure 5, the six 
skipped macroblocks would be represented by the codeword 01001 . 

Referring to Figure 6, there is shown another modification, wherein slices 20 and 22 
are overlapped. This is achieved by ending first slice 20 at boundary C, and starting the 
second slice 22 at boundary B, as respectively shown in Figure 3. As a result, slice 20 ends 
with run-length codeword 24 as described above, to indicate the six skipped macroblocks 
18d-18i. In addition, slice 22 is provided with a codeword 26 to indicate the six preceding 
macroblocks 18d-18i. Thus, information regarding skipped macroblocks 18d-18i will be 
received as long as one of the slices 20 or 22 is received, even if the other is lost. While not 
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shown, slices 20 and 22 could be allowed to overlap other types of macroblocks besides 
skipped macroblocks. 

In a further modification, syntax could be defined to allow slices that only contain 
non-coded or skipped macroblocks. One such slice could be indicated by a single run-length 
codeword, covering the entire slice and containing no macroblock data. 

Referring to Figure 7 there is shown a source 28 of video information such as a video 
camera. The video information is coupled to a video encoder 30, which compresses 
successive frames of data in accordance with an embodiment of the invention, as described 
herein. A bitstream comprising the compressed data is transmitted through a communication 
channel, which may be a wireless communication channel, from a transmitter 32 to a receiver 
34. The received data is applied to a decoder 36 to recover the video information. In its 
operation, the decoder copies pixels corresponding to skipped macroblocks from a previously 
received picture. 

Obviously, many other modifications and variations of the present invention are 
possible in light of the above teachings. It is therefore to be understood that within the scope 
of the disclosed concept, the invention may be practiced otherwise than as has been 
specifically described. 
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